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(54) Expression and purification of an 
HTLV-1 1 1 GAG/ENV gene protein 



(57) The present invention provides 
methods and compositions for the 
cloning and expression of plasmids 
bearing genes encoding a novel protein 
derived from HTLV-III. This protein, 
which is called the gag/env protein and 
which contains antigenicdeterminants 
from both the core and envelope 
proteins of HTLV-III, can bepurifiedto 
homogeneity and used as the basis for 
diagnostic tests to detect the presence of 
antibodies against viruses associated 
with AIDS or the viruses themselves in 
, human sera and other biological fluids. 
The gag/env protein may also be 
formulated for use as a vaccine for 
protection against AIDS through 
prophylactic immunization. 
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Clal 

ATG AAT AGA ATT CGG ATC CAT CGA TGG GAA AAA ATT CGG TTA AGG CCA GGG GGA 
MET Asn Arg lie Arg He His Arg Trp Glu Lys He Arg Leu Arg Pro Gly Gly 

15 gag > 

AAG AAA AAA TAT AAA TTA AAA CAT ATA GTA TGG GCA AGC AGG GAG CTA GAA CGA 
Lys Lys Lys Tyr Lys Leu Lys His He Val Trp Ala Ser Arg Glu Leu plu Arg 

TTC GCA GTT AAT CCT GGC CTG TTA GAA ACA TCA GAA GGC TGT AGA CAA ATA CTG 
Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gin lie Leu 

GGA CAG CTA CAA CCA TCC CTT CAG ACA GGA TCA GAA GAA CTT AGA TCA TTA TAT 
Gly Gin Leu Gin Pro Ser Leu Gin Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr 

AAT ACA GTA GCA ACC CTC TAT TGT GTG CAT CAA AGG ATA GAG ATA AAA GAC ACC 
Asn Thr Val Ala Thr Leu Tyr Cys Val His Gin Arg lie Glu He Lys Asp Thr 

AAG GAA GCT TTA GAC AAG ATA GAG GAA GAG CAA AAC AAA AGT AAG AAA AAA GCA 
Lys Glu Ala Leu Asp Lys He Glu Glu Glu Gin Asn Lys Ser Lys Lys Lys Ala 

CAG CAA GCA GCA GCT GAC ACA GGA CAC AGC AGT CAG GTC AGC CAA AAT TAC CCT 
Gin Gin Ala Ala Ala Asp Thr Gly His Ser Ser Gin Val Ser Gin Asn Tyr Pro 

ATA GTG CAG AAC ATC CAG GGG CAA ATG GTA CAT CAG GCC ATA TCA CCT AGA ACT 
He Val Gin Asn He Gin Gly Gin MET Val His Gin Ala He Ser Pro Arg Thr 

TTA AAT GCA TGG GTA AAA GTA GTA GAA GAG AAG GCT TTC AGC CCA GAA GTA ATA 
Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu Val He 

CCC ATG TTT TCA GCA TTA TCA GAA GGA GCC ACC CCA CAA GAT TTA AAC ACC ATG 
Pro MET Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr MET 

CTA AAC ACA GTG GGG GGA CAT CAA GCA GCC ATG CAA ATG TTA AAA GAG ACC ATC 
Leu Asn Thr Val Gly Gly His Gin Ala Ala MET Gin MET Leu Lys Glu Thr He 

AAT GAG GAA GCT GCA GAA TGG GAT AGA GTA CAT CCA GTG CAT GCA GGG CCT ATT 
Asn Glu Glu Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro He 

GCA CCA GGC CAG ATG AGA GAA CCA AGG GGA AGT GAC ATA GCA GGA ACT ACT AGT 
Ala Pro Gly Gin MET Arg Glu Pro Arg Gly Ser Asp He Ala Gly Thr Thr Ser 

ACC CTT CAG GAA CAA ATA GGA TGG ATG ACA AAT AAT CCA CCT ATC CCA GTA GGA 
Thr Leu Gin Glu Gin He Gly Trp MET Thr Asn Asn Pro Pro He Pro Val Gly 

GAA ATT TAT AAA AGA TGG ATA ATC CTG GGA TTA AAT AAA ATA GTA AGA ATG TAT 
Glu He Tyr Lys Arg Trp He He Leu Gly Leu Asn Lys He Val Arg MET Tyr 

AGC CCT ACC AGC ATT CTG GAC ATA AGA CAA GGA CCA AAA GAA CCT TTT AGA GAC 
Ser Pro Thr Ser He Leu Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp 

TAT GTA GAC CGG TTC TAT AAA ACT CTA AGA GCC GAG CAA GCT TCA CAG GAG GTA 
Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser Gin Glu Val 
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FIGURE 2 (continued) 

AAA AAT TGG ATG RCA GAA ACC TTG TTG GTC CAR ART GCG AAC CCA GAT TOT AAG 
JJJ Asn Trp MET Thr Glu Thr Leu Leu Val Gin Rsn Ala Rsn Pro Asp Cys Lys 

ACT ATT TTA AAA GCA TTG GGA CCA GCG GCT ACA CTA GAA GAA ATG ATG ACA GOV 
S ne Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu Glu Glu MET MET Thr Ala 

TGT CRG GGR GTR GGA GGR CCC GGC CRT AAG GCR RGR GTT TTG GCT GAA GCA ATG 
Cys Gin Gly Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala MET 

AGC CAA GTA ACA RRT RCR GCT RCC RTR ATG ATG CAG AGA GGC AAT TTT AGG AAC 
lev G?n val Thr Asn Thr Ala Thr lie MET MET Gin Arg Gly Asn Phe Arg Asn 

CAA AGA AAG ATG GTT AAG TGT TTC ART TGT GGC AAA GAA GGG CAC ACA GCC AGA 
G?n Arg Lys MET Val Lys Cys Phe Asn cys Gly Lys Glu Gly His Thr Ala Arg 

AAT TGC AGG GCC CCT AGG ARR RRG GGC TGT TGG AAA TGT GGA AAG GAA GGA CAC 
™l cyl Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly Hxs 

Bglll 

CAA ATG AAA GAT TGT ACT GAG AGA CAG GCT AAT TTT TTA GGG AAG ATC TTC AGA 
SE £ Ss Asp cys The Glu A, g Gin Ala Asn Phe LeuGly LysUe Phe_Arg 

CCT GGA GGA GGA CAT ATG AGG GAC AAT TGG AGA AGT GAA TTA TAT AAA TAT AAA 
So T£ S3 Sy Asp MET Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys 

GTA GTA AAA ATT GAA CCA TTA GGA GTA GCA CCC ACC AAG GCA AAG AGA AGA GTG 
SS SS £2 lie Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val 

CTT CAG AGA GAA AAA AGA GCA GTG GCA GCA GGA AGC ACT ATG GGC GCA GCG TCA 

vS G?n Arg" A?g Ala Val Ala Ala Gly Ser Thr MET Gly Ala Ala Ser 

ATG ACG CTG ACG GTA CAG GCC AGA CAA TTA TTG TCT GGT ATA GTC CAG CAG CAG 
kS ihr £eu Thr val Gin Ala Arg Gin Leu Leu Ser Gly lie Val Gin Gin Gin 

AAC AAT TTG CTG AGG GCT ATT GAG GCG CAA CAG CAT CTG TTG CAA CTC ACA GTC 
™ ™ Leu Su Arg Ala lie Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val 

TGG GGC ATC AAG CAG CTC CAG GCA AGA ATC CTC GCT GTC GAA AGA TAC CTA AAG 
1% S£ iTe £y"s Gin Leu Gin Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys 

GAT CAA CAG CTC CTG GGG ATT TGG GGT TGC TCT GGA AAA CTA CTT TGC ACC ACT 

Sp £n Gin Su Leu Gly lie Trp Gly cys Ser Gly Lys Leu Leu Cys Thr Thr 

GCT GTG CCT TGG AAT GCT AGT TGG AGT AAT AAA TCT CTG GAA CAG ATT TGG AAT 
Sa SS Pro Trp Sn Ala Ser Trp Ser Asn Lys Ser Leu Glu Gin lie Trp Asn 

Hindlll 

CAC ACG ACG TGG ATG GAG TGG GAC AGA GAA ATT AAC AAT TAC ACA AGC TTT AAT 
Ss JS Trp MET Glu Trp Asp Arg Glu lie Asn Asn Tyr Thr Ser Phe Asn 



GCG GTA GTT TAT CAC AGT TAA 
Ala Val Val Tyr His Ser . 
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SPECIFICATION 

Expression and purification of an HTLV-III gag/env gene protein 

5 5 
The present Invention relates to a protein designated gag/env, which comprises the core protein (gag) and 
the envelope protein (env) of the HTLV-III virus, the etiologic agent for acquired immune deficiency 
syndrome (AIDS). This protein is produced through organic synthetic methods orthrough the use of 
recombinant DNA techniques in which the requisite gene sequences are inserted by means of a suitable DNA 

10 vectorintoacompatibleunicellularhostorganism. 10 
This invention further relates to the isolation and purification of the gag/env protein and to methods forthe 
detection of AIDS antibodies or viruses in human sera or other biological fluids and forthe 
immunoprophylactic protection of human beings against AIDS, based on the use of the protein. 
In 1 985 nearly 8,000 people were diagnosed as having AIDS, and the number has been steadily increasing. 

15 Fifteen thousand more cases are expected to be diagnosed in 1 986, and the number of cases may well double 1 5 
again in 1987 [New York Times Magazine, March 2, 1986, p. 42]. AIDS has been characterized by the onset of 
severe opportunistic infections secondary to an effect on the body's immune system [Gottlieb et al., New 
Eng. J. Med. 305: 1425 (1981 )]. The disease has been found in male homosexuals, patients receiving blood 
products, intravenous drug addictsand individuals originating from Haiti and Central Africa [Piotetal., 

20 Lancet 77:65(1984)]. 20 
The causative agent was suspected to be of viral origin because the epidemiological pattern of AIDS was 
consistent with that of a transmissible disease. At leastthree retroviruses have been isolated from cultured 
T-cells of several patients with AIDS, or from white blood cells of personsatriskforthedisease. Anovel 
human retrovirus called lymphadenopathy-associated virus (LAV) was discovered, and its properties were 

25 consistent with an etiological role in AIDS. That virus was isolated from a patient with lymphadenopathy and 25 
hence the name [Montagnier et al., in Human T-Cell Leukemia/Lymphoma Virus, R.C. Gallo et al. eds., Cold 
Spring Harbor Laboratory, pp. 363-370 (1984)]. 

Other human retroviruses, specifically two subgroups of the human T-cell 
leukemia/iymphoma/lymphotropic virus, types I [Poiesz et al., Proc. Natl. Acad. Sci. U.S.A. 77:7415(1980)] 

30 and III [Popoviceta1.,Science224:497(1984)] have also been isolated. Still another virus, calledthe 30 
AIDS-associated retrovirus (ARV), has been proposed as the causative agent [Levy et al., Science 225:840 
(1984)]. Both the HTLV-llland ARV retroviruses display biological and seroepidemiological properties similar 
to LAV [Levy et al., supra, Popovic et al., supra]. Thus, at least three retroviruses have been postulated to be 
the etiologic agent or AIDS: LAV, ARV, and HTLV-III. For this application, these viruses will be collectively 

35 referred to asthe AIDS viruses. Because HTLV-III istheprototypic virus for this group, it will be understood 35 
that the term "antigenic determinant corresonding to the sequences of a protein of an HTLV-III vjrus" actually 
refers to the sequences of the proteins of any of the AIDS viruses. 

One reason forthe difficulty in determining the true etiologic agent of AIDS was cross-reactivity of various 
retroviral antigens with serum samples from AIDS patients. For example, serum samplesf rom AIDS patients 

40 have been shown to react with antigens of both HTLV-I [Essex et al., Science 220:859 (1 983)] and HTLV-III 40 
[Sarngadharanetal., Science 224: 506 (1984)]. Envelope gene products of HTLV demonstrated antigenicities 
that were cross-reactive with antibodies in sera from adult T-cell leukemia patients [Kiyokawaetai., Proc. 
Natl. Acad. Sci. U.S.A. 57:6202 (1 984)]. AdultT-cell leukemias (ATL) differfrom acquired immune deficiency 
syndrome (AIDS) in that HTLV-I causes T-cell malignancies that are characterized by the uncontrolled g rowth 

45 of T-cells. In AIDS,instead of cell growth there iscell death. In factthiscytopathic characteristic of HTLV-III 45 
was critical to ultimately determining the specific retroviral origin of the disease. 

The etiologic agent of AIDS was isolated by the use of immortalized human neoplasticTcell lines (HT) 
infected with the cytopathic retrovirus characteristic of AIDS, isolated from AIDS afflicted patients. 
Seroepidemiological assays using this virus showed a complete correlation between AIDS and the presence 

50 of antibodiesto HTLV-III viral antigens [Montagnier etal., supra; Sarngadharanetal., supra; Schupbachet 50 
al., Science 224:503 (1984)]. In addition, nearly 85% of patients with lymphadenopathy syndrome and a 
significant proportion of asymptomatic homosexual men in AIDS endemic areas were also found to carry 
circulating antibodiesto HTLV-III. Taken together, these data implicate HTLV-III asthe principal etiologic 
agent for AIDS. 

55 Until the successful culturing of the AIDS virus using theH-9ceII line, the env AIDS protein of the AIDS 55 
virus had not been isolated, characterized or synthesized. This in major part was dueto the fact that the virus 
is cytopathic, and isolation of the virus was thus not possible [Popovic et al., supra]. Once a human T-cell line 
that was resistant to the cytopathic effects of the virus was discovered, however, the molecular cloning of 
AIDS proviral DNA could be carried out 

60 The need for sensitive and rapid methods for the diagnosis of AIDS in human blood and in other biological 60 
fluids and for a method to prevent the disease by vaccination is very great. Virtually all of the assays and tests 
presently available are fraught with errors. In fact, the Centerfor Disease Control (CDC) has indicated that 
presently available tests should be used solelyior screening units of blood forthe presence of antibodiesto 
HTLV-III. The CDC has even gone further by stating that the presently available enzyme-linked 

65 immunosorbent assay (EUSA) tests should not be used forthe general screening of high risk populations or 65 
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as a diagnostictest for AIDS [Federal Register 50(48): 9909, March 12, 1985]. 

The errors in previously used AIDS tests have been traced to the failure to use a specific antigenic protein of 
the etiologicagent for AIDS. The previously used proteins were derived from a viral lysate. Since the lysate is 
made from human cells infected with the virus, i.e. the cells used to grow the virus, the lysate will contain 
5 human proteins as well as viral proteins. Thus preparation of a pure antigen of viral protein is very difficult. 5 
The antigens used until now have thus produced both false positive and false negative results [Budiansky, 1 
Nature 3 12: 583 (1 984)]. The errors caused by the use of such lysate proteins/peptides could be avoided by 
using a composition for binding AIDS antibodies which is substantially free of the non-AIDS specific 
proteins. Compositions of substantially pure AIDS envelope and core protein can be used as antigens. 
10 BoththeenvelopeandcoreproteinsofHTLV-lllhavecpnservedantigenicdeterminantswhichwould 10 
permit their use to screen for, diagnose and/or provide protection by vaccination against the AIDS viruses. 
And individuals that have been exposed to HTLV-lll and who may thus be at risk to contract AIDS or who have 
the disease can be identified by the presence in their blood of antibodies to the viral core protein (gag) and/or 
the envelope protein (env), [Samgadharan et al., Science 224: 506 (1984)]. 
1 5 The availability of a reliable and sensitive test forthe presence in blood or in other biological fluids of the 1 5 
AIDS virus itself, or of particles therefrom, is also important. Groopman et al. [Blood 66:742 (1985)] have 
reported that antibodies against AIDS viruses are not always present in the blood of AIDS victims. Groopman 
etal. examined one patient with AIDS and another with related disorders (ARC) from whom blood samples 
were taken which were antibody negative butfrom which HTLV-lll could be cultured. 
20 Recombinant DNA technology has recently been applied to the AIDS problem. The molecular cloning and 20 
expression of the env gene from HTLV-lll has been reported [Crow! etal., Cell 47:979 (1985); Chang etal., 
Biotechnology 3:905 (1 985)]. Dowbenko et al. [Proc. Natl. Acad. Sci. U.S.A. 52:7748 (1 985)] have expressed 
the HTLV-lll core protein in E. coli. 
Through the use of such genetic engineering techniques, purified viral antigens that are safe, reliable and 
25 lesscostly to produce have become available. Even more advantageous, however, would be the availability 25 
of a viral protein having antigenic determinants that are present on both the env and the gag proteins. Such a 
protein would be an exceptionally powerful tool forthe detection of AIDS antibodies, and it could serve asthe 
basis for a variety of sensitive diagnostic tests and for a possible vaccine against the AIDS virus. 
Therefore a novel protein, designat gag/en v, having an amino acid sequence corresponding to at least one 
30 antigenicdeterminant of a HTLV-lll gag protein and of a HTLV-lll env protein has been provided which 30 
permits to screen for, diagnose and/or protect by vaccination against the AIDS virus. The novel gag/env 
protein can be represented by all or part of the amino acid sequence shown in Figure 2 or afunctional 
equivalentthereof. 

The invention further provides the requisite DNA sequences coding forthe gag/env polypeptide of the 

35 present invention, recombinant vectors containing such DNA sequences, and unicellular organisms useful in 35 
the production through recombinant DNA technology of the gag/env protein of the present invention as well 
as processes forthe manufacture of such DNA sequences, recombinant vectors and unicellular organisms. In 
addition methods are described for the expression and isolation of the gag/env protein of the present 
invention. The gag/env protein thus produced may be utilized by the methods of this invention for a number 

40 of important immunological processes. 

Through use as a diagnostic reagent, the gag/env protein can be used to detect the presence of antibodies 
against AIDS viruses in human sera or in other biological fluids such astears, semen, vaginal secretions and 
salvia. Because the protein can be prepared in a homogeneous form, problems of non-specific reactions 
which have plagued the use of diagnostic reagents based upon relatively crude HTLV-lll viral protein isolates 

45 in the past are eliminated. 

Used as an immunogen, the gag/env protein can be employed to produce antibodies in animals againsttne 
antigenic determinants contained therein. Such antibodies can in turn be used, in conjunction with the 
gag/env protein which has been suitably labeled, in a radioimmunoassay (RIA) or enzyme-linked 
immunosorbent assay (EUSA) to detect the presence of HTLV-lll viruses or particles therefrom in human 

50 serum or in other biological fluids such as tears, semen, vaginal secretions and saliva. Particles (or 50 
fragmentsjthatcouldbedetectedbythesemethodsincludeofcoursepiecesoftheviralcoreorenvelope 

proteins. 

By incorporation into a suitable vaccine formulation, the gag/env protein of the present invention may 
further be used to combatthe spread of AIDS through prophylactic immunization. 

55 There are important advantages to using the gag/env protein of the invention, compared to using separate 55 
gag and env proteins. The env protein itself is very insoluble, which makes purification difficult. Combination 
of the two proteins results in a product that is more easily sol ublized and therefore more easily purified. 
Large-scale production and purification arealso simplified, of course,sinceonly a singiecompoundmustbe 
isolated. Finally, the gag/env protein makes possible the development of a diagnostic kit which works better 

60 in an antigen sandwich assay than would separate gag and env proteins. 

Although the gag/env protein was derived from HTLV-lll, it must be understood that the diagnostic and 
immunoprophylactic methods described herein are applicable to the detection of any of the other viral 
agents which have been implicated in or associated with AIDS such as lymphadenopathy-associated virus 
(LAV) and AIDS-associated retrovirus (ARV). This is because Crowl et al. [Cell 41: 979 (1985)] have shown that 

65 proteins from these viral agents are immunologically related to those of HTLV-lll and have amino acid 
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sequence segments in common. 

The present invention may be more readily understood on the basis of the following detailed description 
when considered in connection with the following figures, wherein 
Figure 1 is a schematic representation of expression plasmids which directs the synthesis of gag (upper 
5 drawing) and gag/env (lower drawing) proteins. Restriction endonuclease sites delineating the gag and env 5 
genes are shown, and the cross-hatched region denotes the env segment of the gag/envf usion gene. In both 
plasmids transcription is under the control of the \ P L promoter, and translation initiation signals are from 
plasmid pEV-vrf2; 

F/pure2showsthe nucleic acid sequence of the gag/env fusion gene (with selected position numbers and 
10 restriction endonuclease sites indicated) and the amino acid sequence of the gag/env protein predicted 10 
therefrom; and 

Figure3$hows the results of SDS-polyacrylamidegel electrophoretic analysis of the gag/env protein 
produced in E.coli. Panel 1 shows Coomassie blue-stained total cell proteins from cells harboring the 
recombinant plasmid carrying the gag/envfusion gene and from control cells. Panels 2 and 3 are Western 

1 5 blots of total cell protein probed with either rabbit antibodies against env peptide 500-51 1 (panel 2) or sheep 1 5 
antibodies against gag p24 (panel 3). The immune complexes in panels 2 and 3 were visualized by a second 
antibody labeled with horseradish peroxidase. In all three panels g/e refers to the gag/env protein, and c 
refers to a negative control sample used to show specificity. The mobilities of molecu la r weight standards (in 
kD) are shown, and the position of the gag/env protein band in the gels is indicated by the arrows. 

20 The methods of this invention entail a number of steps which, in logical sequence, include (1 ) identification 20 
and isolation of the genes encoding the gag and env proteins or fragments thereof, (2) insertion of these 
genes or gene fragments into an appropriate cloning vehicle to produce a recombinant vector containing a 
gag/env fusion gene, (3) transfer of the recombinant cloning vehicle into a compatible single cell host 
organism, (4) selection and growth of properly modified hosts that can replicateand express the inserted 

25 gene sequences, (5) identification and purification of the gene product, (6) use of the gene productto detect 25 
antibodies against HTLV-III or related viruses or an as immunogen to produce antibodies that can in turn be 
used to detect the viruses themselves or fragments therefrom in human sera or in other biological fluids, and 
(7) use of the gag/env protein in a vaccine formulation for possible immunoprophylactic protection against 
AIDS. 

30 Isolated HTLV-III virions could be used as the source for both the gag and the env genes of the invention. 30 
Forexample, Dowbenkoetal. [Proc. Natl. Acad. Sci. U.S.A. 52:7748 (1985)] haveuseda 2.2kilobase(kb) 
fragment from the 5' region of the viral genome as a source of the gag gene. Alternatively, genomic DNA 
from cells into which the proviral genome of HTLV-III has been integrated can be the gene source [Shaw et. 
al., Science 226: 1 165 (1 984)]. Crowl et al. [Cell 4/:979 (1 985)] have used such genomic DNAf rom:H9cells 

35 infected with HTLV-III to obtain the env gene. 35 
Alternatives to the isolation of the gag and env genes include but are not limited to the chemical synthesis 
of the gene sequences and the preparation of DNA that is complementary to the messenger RN A produced 
from the genes. 

Regardless of its source, DNA encoding the gag and env proteins can be cloned into bacteria, and clones 
40 containing the gag or env genes can be identified by methods that are well known in the art. Forexample, 40 

complimentary DNA (cDN A) probes could be prepared for the genes and used to detect clones bearing the 

genes through hybridization techniques. 
In a preferred embodiment of the present invention, a genomic library constructed by Xbal digestion of the 

DNA of the HTLV-lll-infected H9 cells described above was cloned using a phase \ vector, and clones bearing 
45 the HTLV-III proviral genome were identified by hybridization with HTLV-III cDNA. One such clone was 45 

designated \HXB-3, and a 1700 base-pair fragment encoding most of the gag precursor protein was isolated 

by Clal/Hincll digestion of the DNAfrom this clone. 
The immediate source of env gene sequences used in the preferred embodiment of the invention was a 

derivative of plasmid pEV3/env 44-640 that had env codons corresponding to residues 51 4-524 deleted. This 
50 deletion surprisingly results in a significant increase in the expression of the env gene. Env sequences 50 

containing codons 467 to 640 were obtained by the Bglll/Hmdlll cleavage of plasmid pEV3/env 44-640. 
Once identified and isolated, the gag and env genes of HTLV-III are inserted into an appropriate expression 

vehicle which contains the elements necessary for transcription and translation of the inserted gene 

sequences. Useful cloning vehicles may consist of segments of chromosomal, nonchromosomal and 
55 synthetic DNA sequences such as various known bacterial plasmids, phage DNA, com binations of plasmids 55 

and phage DNAs such as plasmids which have been modified to employ phage DNA or other expression 

control sequences, or yeast plasmids. Specific cloning vehicles which could be used include but are not 

limited to the pEV-vrf plasmids (pEV-vrf 1 ,-2 and -3), SV40, adenovirus, yeast, lambda gt-WES-lambda B, 

Charon4Aand28,Lambda-gt-1-lambda B, M13-derived vectors such as pUC8, 9, 18and 19,pBR313,322and 
60 325, pAC105, pVA51, pACY177, pKH47, pACYC184, pUB1 10, P MB9,Co1E1, pSC101, pm121,RSF2124, pCR1 60 

orRP4. 

The insertion of the gag and env genes into a cloning vector is easily accomplished when both the genes 
and the desired cloning vehicle have been cut with the same restriction enzyme or enzymes, since 
complementary DNA termini are thereby produced. If this cannot be accomplished, it may be necessary to 
65 modify the cut ends that are produced by digesting back single-stranded DNA to produce blunt ends, or by 65 
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achieving the same result by filling in the single-stranded termini with an appropriate DNA polymerase. In 
this way, blunt-end ligation with an enzyme such asT4 DNA ligase may be carried out Alternatively, any site 
desired may be produced by ligating nucleotide sequences (linkers) onto the DNA termini. Such linkers may 
comprise specific oligonucleotide sequences that encode restriction site recognition sequences. The cleaved 

5 vector and the gag and env gene fragments may also be modified by homopolymeric tailing, as described by 5 
Morrow [Methods in Enzymology 68:3 (1 979)]. 

Once either the gag orenvgene or afragment thereof has been inserted into an appropriate cloning 
vehicle, theother gene or gene fragment can be introduced into the vehicle by judicious restriction 
endonuclease cleavage to place the second gene or gene fragment into juxtaposition with the first, thereby 

10 creating a fusion gene. Of Course if thegag and env genes are chemically synthesized, thefusiongenecould 10 
be directly produced and introduced as a single DNA fragment into the cloning vehicle. 

In the preferred embodiment of the present invention, the Clal/Hincll digestion fragment containing the 
gag gene from \HXB-3 described above was ligated into plasmid pEV-vrf2 which had been cleaved withClal 
and Puvll to produce a recombinant expression plasmid pEV2/gag 15-512 (Figure 1, upper drawing) that 

1 5 directed the synthesis of a 56 Kd protein comprising gag residues 1 5-51 2. The gag sequences downstream of 1 5 
a Bglll site near residue 437 were then replaced with env sequences in a Bgl ll/Hindlll fragment from the 
derivative of plasmid pEV3/env 44-640 (having residues 514-524 deleted) described above to produce 
plasmid pEV2/gag 1 5-436/env 467-640 A 514-524 bearing the gag/envfusion gene (Figure 1 , lowerdrawing). 
Full details of this construction are provided below in the section entitled "Example", in particular in 

20 subsections B to E. The nucleic acid sequence of the gag/env gene of the preferred embodiment [determined 20 
by the chemical cleavage method of Maxam et al., Methods in Enzymology 65:499 (1 980)] and the amino acid 
sequence of the gag/env protein predicted therefrom are shown in Figure 2. 

Many of the cloning vehicles that may be used in this invention contain one or more marker activities that 
may be used to select for desired transformants, suchas ampicillin and tetracycline resistance in pBR322, 

25 ampicillin resistance and 0-galactosidase activity in pUC8, and ampicillin resistance in pEV-vrf2. Selection of 25 
host cells into which such vectors have been inserted is greatly simplified when the host cells otherwise lack 
the activities contributed by the vectors. 

It should be understood that the nucleotide sequences of the gag and env gene fragments inserted at a 
selected site in a cloning vehicle may include nucleotides which are not part of the actual structural genes. 

30 Alternatively, the gene fragments may contain only part of the complete genes. All that is required is thatthe 30 
genefragments inserted into the cloning vehicle be capable of directing the production in an appropriate 
host organism of a polypeptide or protein having at least one antigenic determinant corresponding to the 
sequences of the gag and env proteins. 
The selection of an appropriate host organism is affected by a number of factors known in the art. These 

35 factors include, for example, compatibility with the chosen vector, toxicity of proteins encoded by the hybrid 35 
plasmid, ease of recovery qf the desired protein, expression cha racteristics, biosaf ety and costs, A balance of 
these factors must be struck, and it must be understood that not all hosts will be equally effectivefor 
expression of a particular recombinant DNA molecule. 
Suitable host unicellular organisms which can be used in this invention include but are not limited to plant, 

40 mammalian oryeast cells and bacteria such as Escherichia coli, Bacillus subtilis, Bacillus stearothermophilus 40 
and Actinomyces. Escherichia coli strain MC1061 which has been described by Casadaban etal. [J. Mol. Biol. 
/3S:179 (1980)], can be used as can be used any other strain of E. coli K-1 2 containing the plasmid pRK248clts. 
Plasmid pRK248clts for use in other E. coli K-1 2 strains is f reeby available from the American TypeCulture 
Collection (ATCC) and has accession No. ATCC 33766. E. coli strain MC1 061 has also been deposited at ATCC 

45 (accession No.ATCC53338) and is also freely available upon request 45 
Transfer of the recombinant cloning vector into the host cell may be carried out in a variety of ways. 
Depending upon the particular vector/host cell system chosen, such transfer may be effected by 
transformation, transduction ortransfection. Once such a modified host cell is produced, the cell can be 
cultured and the protein expression product may be isolated from the culture. 

50 As produced in E.coli, the gag/env protein is largely confined to cytoplasmic inclusion bodies (insoluble 50 
protein aggregates) of the bacterium, a fact which greatly facilitates purification of the protein. To isolate the 
gag/env protein product of this invention, the bacterial cells are preferably disrupted or lysed and the 
insoluble protein is recovered by centrifugation. Substantial purification can then be obtained by sequential 
washing of the precipitate with increasing concentrations of urea followed by the use of standard protein 

55 purification techniques. 55 
For small amounts of material such assamplestakenforpolyacrylamidegel electrophoretic analysis, the 
cells can be disrupted by treatment with a detergent such as sodium dodecyl sulfate (SDS). Larger quantities 
of the protein are best recovered by sonication, or by other mechanically disruptive means, such as the 
French pressure cell. 

60 Cell disruption could also be accomplished by chemical or enzymatic means. Since divalent cations are 60 
often required for cell membrane integrity, treatment with appropriate chelating agents such as EDTAor 
EGTA might prove sufficiently disruptive to facilitate the leakage of the protein from the cells. Similarly, 
enzymes such as lysozyme have been used to achieve the same result That enzyme hydrolyzesthe 
peptidoglycan backbone of the cell wall. The application of alternating freezing and thawing in conjunction 

65 with lysozyme treatment could also be employed. 65 
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Once freed from the cells, the gag/env protein may be identified in the mixture by any of the methods 
known in the art. For example, a radioimmunoassay or enzyme-linked immunosorbent assay could be 
carried out using antibodies against the protein. Preferably, the protein is identified by SDS polyacrylamide 
gel electrophoresis followed by Western blot or similar analysis. 

5 The gag/env protein of the invention can be concentrated by precipitation with salts such as sodium or 5 
ammonium sulfate, by ultrafiltration or by the use of other methods well known to those skilled in the art. 
Further purification can be accomplished by conventional protein purification techniques including but not 
limited to gel filtration, ion-exchange chromatography, preparative disc-gel or curtain electrophoresis, 
isoelectric focusing, low temperature organic solvent fractionation orcountercurrent distribution. 

10 Because the gag/env protein contains antigenic determinants from two major components of the HTLV-lll 1 0 
virus, and because this virus is the principal etiologic agent in AIDS and is immunologically related to the 
other viral agents which have been implicated in or associated with AIDS or ARC, the protein is a powerful 
diagnostic tool forthe detection of antibodies against AIDS viruses in human serum. Thefusion can be used 
for this purpose in numerous ways known in the art. 

1 5 For example, the gag/env protein could be labeled in any number of ways, the labeled protein could be 15 
added to a human serum sample suspected to contain antibodies against AIDS viruses to form labeled fusion 
protein-antibody complexes, and the complexes thus formed could be detected by appropriate means. By 
way of further example, the protein could be immobilized on a solid support and then contacted with a 
human serum sample. Antibodies against AIDS viruses in the sample would bind to the immobilized protein, 

20 and the complexes thus formed could be detected by the use of a reagent such as Staphylococcus aureus 20 
protein A (labeled, e.g., with lodine-1 25) or a second anti-human IgG antibody (lableled, e.g., with a 
radioisotope or with horseradish or lactoperoxidase) after uncomplexed proteins and antibodies have been 
washed away. Many variations on these themes will be apparent to the skilled artisan, some of which are 
suggested below. 

25 Through the use of antibodies against the gag/env protein, a variety of diagnostic tests forthe presence of 25 
AIDS viruses or particles therefrom in human serum or in other biological fluids can be devised. Such 
antibodies can be produced by injecting a mammalian or avian animal with a sufficient amount of a vaccine 
formulation comprising the protein and a compatible pharmaceutical carrier to elicit the production of 
antibodies against the protein. The appropriate amount of the protein which would be required would be 

30 known to one of skill in the art or could be determined by routine experimentation. As used in connection 30 
with this invention, the term "pharmaceutical carrier" can mean either the standard compositions which are 
suitable for human administration or the typical adjuvants employed in animal vaccinations. 

Suitable adjuvants for the vaccination of animals include but are not limited to Freund's completeor 
incomplete adjuvant (not suitable for human or livestock use), Adjuvant 65 (containing peanut ojl, mannide 

35 monooleateand aluminum monostearate), and mineral gels such as aluminum hydroxide, aluminum 35 
phosphate and alum; surfactants such as hexadecylamine, octadecylamine, lysolecithin, 
dimethyldioctadecylammonium bromide, N r N-dioctadecyl-N'-N-bis(2-hydroxyethyl-propanediamine), 
methoxyhexadecylglycerol, and pluronic polyols; polyanions such as pyran, dextran sulfate, poly IC, 
poiyacrylic acid, carbopol; peptides such as muramyl dipeptide, dimethylglycine, tuftsin; and oil emulsions. 

40 The gag/env protein could also be administered following incorporation into liposomes or other 40 
microcarriers, or after conjugation to polysaccharides, other proteins or other polymers. 

Typically, the initial vaccination is followed some weeks later by one or more "booster" vaccinations, the 
net effect of which is the production of high titers of antibodies against the gag/env protein, which can be 
harvested in the usual way. 

45 Of course, monoclonal antibodies againstthe protein could be produced through current technology to 45 
achieve the same result. Somatic cells with the ability to produce antibodies, such as B cells, may befused 
with myeloma cell lines to produce hydridoma cells. These cells may be cultured in vitro or as ascites tumors 
indefinitely to produce large quantities of specific antibodies. Because hybridoma cells may be readily 
cloned, it is possible to rapidly produce large numbers of cells, all of which produce the same specific 

50 antibody molecules directed to a common antigenic determinant This exceptional uniformity in antibody 50 
production may be advantageous where the antibodies are to be used in specific diagnostic tests. 

Lymph nodes and spleens of animals of animals primed by injection of an antigen are convenient sources 
of B cells, although it is equally feasible to remove these cells from unsensitized animals, and to prime them 
in vitro after isolation. Mouse and rat B lymphocytes are most frequently used in hybridoma production, but 

55 cellsfrom rabbits, human beings, frogs orother animals might be used instead. 55 
Numerous specialized myeloma cell lines have been developed from lymphocytic tumors for use in 
hybridoma production [Kohler and Milstein, European J. Immunol. 5:51 1 (1 976); Shulman et al., Nature 
276:269 (1978)]. Of the many such cell lines produced, P3/X63-Ag 8, P3/NSI/1-Ag 4-1,Sp2/0-Ag14,and 
S194/5.XXO.BU.1 have frequently been used. 

60 Thefusion of the antibody-producing spleen or lymph node cells with myeloma cells to produce 60 
hybridomas is usually ca rried out with a n excess of splenocytes or lym phocytes over myeloma cells that may 
be as high as 20: 1 although, typically, lower ratios are used. Fusion is facilitated by the use of a 
fusion-promoting agent such as UV-inactivated Sendai virus or polyethylene glycol (PEG). Gefter et al. 
[Somatic Cell Genet. 3:231 (1 977)] have reported that combining dimethyl sulfoxide with PEG further 

65 enhances cell fusion. Electrical devices are also available which can fuse cells with an exceptionally high 65 
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degree of efficiency. 

Once fusion has occurred, the hybridoma cells must be selected from the unfused parental cell strains. This 
selection process may be readily accomplished by culturing the cells in a medium that supports hybridoma 
but not parental cell growth. The somatic B cells used in thefusion have limited lifespans in culture andthus 

5 will be lostasthey undergo sensecence and death, butthe parental myeloma cells, with indefinite culture 5 
lifespans, must be eliminated by special selection techniques. 

In a commonly used system, myeloma cells lacking hypoxanthine phosphoribosyl transferase (HPRT") are 
used. These cells lack the scavenger pathway for the reutilization of hypoxanthine free base and cannot 
survive if an inhibitor, such as aminopterin, is used to blockthe de novo purine synthetic pathways. The 

10 myeloma parental cells may thus be selected against by culturing the fusion mixturein 10 
hypoxanthine/aminopterin/thymidine (HAT) medium, while the hybridoma cells will survive dueto the 
contribution of HPRT by the antibody-producing fusion parental cells. 

After a period of selection culturing, the surviving hybridoma cells may be cloned, stocks may be grown up 
by standard cell culture methods, and clones producing desired specific immunoglobulins may be detected 

1 5 by enzyme-linked immunosorbent assay (ELISA) or by other tests, based upon the use of the antigen against 1 5 
which the antibodies are directed. 

The anti-gag/env protein antibodies obtainable through the use of this invention may further be used for 
the preparation of a variety of diagnostic tests for AIDS viruses or particles therefrom. Such diagnostic 
systems could take the form of a radioimmunoassay, either in free solution orsolid state. Alternatively, 

20 enzyme-linked immunosorbent assays could be produced, as could assays based on immunoblot analysis. 20 
These assays could be direct or indirect, with the application of a second antibody directed againstthe 
anti-gag/env protein antibodies. Numerous enzymatic activities could be coupled to the antibodies, with 
peroxidase, glucose oxidase, p-galactosidase and alkaline phosphatase being only a few of the possibilities. 
The basic principle underlying many of these tests is that a human serum or other biological fluid sample 

25 suspected to contain AIDS viruses or fragments therefrom is reacted with a known titer of antibodies against 25 
the gag/env protein to form antigen-antibody complexes. The complexes thusformed are detected by an 
appropriate means. 

Those skilled in the art will also recognize that there are many other ways in which anti-gag/env protein 
antiserum could be utilized in a diagnostic capacity, such as in one of a number of agglutination tests. In such 

30 agglutination assays, the interaction of antibodies and AIDS viruses or viral fragments may be detected using 30 
systems in which particles are coated with the anti-gag/env protein antibodies. Such particles may be latex 
beads, liposomes, erythrocytes, polyacrylamide beads, oranyofanumberof suitable polymers. 

The method forthe determination of AIDS virus or of antibodies against AIDS virus as described abovecan 
be conducted in suitable test kits comprising in a containerthe gag/env protein of the present invention or 

35 antibodies against AIDS virus elicited by the gag/env protein of the present invention. 35 
Crowl etal. [Cell 41,979-986 (1985)] have described that the serum of all of 50 AIDS patients tested with 
genetically engineered env protein was higly reactive to the protein. This was true despite the fact that half of 
these patients were from the west coast of the U nited States, and half were from the east coast. The fact that 
all of the sera tested were reactive to the env protein showed that the viruses producing the antibodies that 

40 thus reacted, although geographically separated and undoubtedly somewhat genetically different, must 40 
have had at least one common or conserved antigenic determinant in their envelope proteins. 

The gag core protein of the AIDS retrovirus has been found to induce antibody formation in a large percent- 
age of individuals who had previously been exposed to the virus [Montag nier et al., supra; Schupbach et al., 
supra; Sarngadharan etal., supra]. 

45 Taken together, the above observations suggestthat the gag/env protein of this invention, which has at 45 
least one antigenic determinant corresponding to the sequences of the gag and env proteins of which it is 
comprised, would be immunogenic in human beings and may be useful as an AIDS vaccine. 

Thus, the gag/env protein (either produced as described herein or chemically synthesized) can be used in a 
vaccine formulation comprising the protein and a compatible pharmaceutical carrier. The protein can be 

50 used as purified in such formulations, or it can be made more immunogenic by modifications known in the 50 
art For example, the protein can be converted into a highly immunogenic matrix by reaction with a cross- 
linking agent such as 1,3-dicyclohexylcarbodimide. Alternatively, the protein could be covalently coupled to 
a highly immunogenic protein carrier molecule, either directly or with an appropriate linker molecule. Carrier 
molecules which could be used include, e.g., limpet hemocyanin and various baceterial toxoids (inactive 

55 toxins) such as diphtheria toxoid. Where the gag/env protein is conjugated to a bacterial toxoid, a bivalent 55 
vaccineformuiation can be produced that will provide protection against both AIDS and the bacterium from 
which the toxoid was derived. 

Routes of administration, antigen dose, numberfrequency of injections are all factors falling within the 
ordinary skill in the art. 



60 



60 



EXAMPLE 

The following is a non-limiting example illustrating the methods by which a recombinant vector encoding 
the gag/env protein of the HTLV-IH virus can be produced. In this example, the following steps were carried 
out, each of which will be described in detail below: 
65 (DTheDNA sequence encoding residues 15-51 2 (all but the first 14 amino-terminal residues) of the gag 65 
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gene was excised from recombinant phage clone \HXB-3 and ligated into E. coli expression plasmid 
pEV-vrf2to produce plasmid pEV2/gag 15-512; 

(2) DNA sequences corresponding the env amino acid residues 31 9-331 were deleted from expression 
plasmid pEV3/env 44-640 by primer-directed mutagenesis to produce plasmid pEV/env 44-640 4> 31 9- 

5 331; 5 

(3) Primer-directed mutagenesis was carried out on plasmid pEV/env 44-640 A 31 9-331 to produce the 
deletion of nucleotides encoding amino acid residues 51 4-524 and plasmid p EV3/env 44-640 <t> 319-331 A 
514-524; and . - 

(4) A restriction endonuclease fragment of the env genefrom plasmid pEV/env 44-640 A 319-331 A 514-524 

10 encoding env residues 467-640 A 514-524 was ligated into cleaved plasmid pEV2/gag 15-512, thereby 10 
producing a fusion gene encoding gag residues 1 5-436 and env residues 467-640 A 51 4-524 in plasmid 
pEV2/gag 1 5-436/env 467-640 A 51 4-524. 
At each stage of the construction plasmidswere reproduced by transformation intoE. coli strain MCI061 
(pRK248clts). 

15 15 

A. General procedures for recombinant vector preparation 
DNA preparation 

Small scale isolation of plasmid DNA from 1 ml of saturated overnight cultures was carried out according 
totheprocedureof Birnboimetal. [Nucleic Acids Research 7:1513 (1979)]. This procedure allows the isola- 
20 tion of a small quantity of DNAfrom a bacterial colony for analytical purposes. Largeramountsof plasmid 20 
DNA were prepared using 1-liter cultures following a standard protocol with cesium chloride centrifugation. 

Conditions for enzymatic reactions 
The restriction enzymes and the T4 DNA ligase used were all products of New England BioLabs, Beverly, 
25 MA. The methods and conditions for the use of these enzymes were those published by the manufacturer. 25 
For the restriction endonucleases, a unit of activity is defined as the amount of enzyme needed to produce a 
complete digest of 1 .0 ^g DNA in 60 minutes in a total reaction volume of 0.05 ml, with digestion carried out 
at 37°C. The digestion mixtures, in addition to the DNA to be cleaved, contained 100 p,g/ml bovineserum 
albumin and the following buffer components. 
30 Bglll : 100 mM NaCI, 1 0 mM Tris-HCI (pH 7.4), 1 0 mM MgCI 2 and 1 0 mM 2-mercaptoethanol (2-ME) 30 

Clal : 50 mM NaCI, 6 mM Tris-HCI (pH 7.9) and 6 mMMgCI 2 

Hindi : 100 mM NaCI, 1 0 mM Tris-HCI (pH 7.4), 7 mM MgCI 2 and 1 mM dithiothreitol (DTT) 
35 35 

Hindlll : 50 mM NaCI, 50 mM Tris-HCI (pH 8.0) and 10 mM MgCI 2 

Hpal : 6 mM KCI, 1 0 mM Tris-HCI (pH 7.4), 1 0 mM MgCI 2 and 1 mM DTT 

40 Pstl : 100mMNaCI,10mMTris-HCI(pH7.5)and10mMMgCI 2 40 

Pvull : 60mMNaCi,6mMTris-HCI(pH7.5),6mMMgCI 2 and6mM2-ME 

Stul : 100 mM NaCI, 10 mM Tris-HCI (pH 8.0), 10 mM MgCI 2 and 6 mM 2-ME 

45 *& 

T4 DNA ligation was carried out at 1 6°C in a mixture containing the DNA and 50 mM Tris-HCI {pH 7.8), 1 0 
mM MgCI^ 20 mM DTT, 1 .0 mM ATP and 50 ^g/ml bovine serum albumin. A unit of T4 DNA ligase activity is 
defined as the amount required to give 50% ligation of Hindlll fragments of lambda DNA in 30 minutes at 1 6°C 
in 20 \l\ of incubation mixtureand a 5' DNA termini concentration of 0.12 p.M (300 jig/ml). 
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Purification ofDN A fragments in agarose 

Following restriction endonuclease cleavage, DNAfragments for cloning were isolated by electrophoresis 
in 1% agarose. After visualization by 1 jig/ml ethidium bromide, slices of the gel containing desired DNA 
5 fragments were extruded through a 21 gauge needle into 4 ml of a solution containing 10 mMTris-HCI (pH 
7.4), 1 mM EDTA and 300 mM NaCI. This mixture was then frozen for 3 hours at -80°C, thawed for 30 minutes 
at37°Cand centrifuged at 10,000 x g for 30 minutes. The supernatantfluid was filtered through a 0.45 micron 
Millex filter, concentrated to 0.25 ml with sec-butanol and ethanol precipitated three times with 1 0 \lq of E. 
coli tRNA (transfer RNA) as a carrier. 

10 

Culturemedia 

M9CA medium contained 10 g Na 2 HP0 4 , 3 g KH 2 P0 4 , 0.5 g NaCI and 1 g NH 4 CI per liter, with 1 mM MgS0 4 , 
0.5% glucose and 0.5% casamino acids, adjusted to pH 7.4. 
Luria Broth (LB) contained 5 g Bacto-yeast extract, 1 0 g Bacto-tryptone and 10 g NaCI per liter, adjusted to 

15 pH7.5. 15 
The antibiotics tetracycline and ampicillin were added to final concentrations of 1 5 and 50 p,g/ml, re- 
spectively, were indicated. 

Transformation and isolation of recombinants 
20 The transforamtion of E. coli strain MCI061 (pRK248clts) was carried out using a modification of the prot- 20 
ocol of Kushner [in Genetic Engineering, eds. Boyer et al., Elsevier/North-Holland, Amsterdam, p. 1 7 (1 978)], 
essentially as follows. 

Two-hundred ml of the bacterial cells were grown at 30°C in LB to an O.D.eoo of between 0.5 and 1 .0, after 
which the cells were collected by centrifugation at 6,000 x g f or 5 minutes at 4°C and resuspended in 50 ml of 

25 a solution containing 10 mM morpholinopropane sulfonic acid {MOPS; pH 7.0) and 10 mM RbCI. Thecells 25 
were again collected by centrifugation and resuspended in 30 ml of a solution containing 1 0 mM MOPS (pH 
6.5), 50 mMCaCI 2 and 10mM RbCI. After incubation at 0°C for 30 minutes, the eel Is were collected, resuspen- 
ded'in 6 ml of 1 0 mM MOPS (pH 6.5) with 50 mM CaCI 2 , 1 0 mM RbCI and 1 5% glycerol, aliquoted into 40 
Eppendorf tubes (300 fil/tube) and frozen at -80°C. 

30 Transformation was performed by thawing a 100 pJ aliquot of the cells and incubating for 30, 2 and 2 30 
minutes at 0°C, 37° and 0°C, respectively in the presence of a 10 \l\ sample of ligation mixture containing about 
lOOng of DNA and adding 0.1 to 0.5 ml of LB to the tube. The tube was then incubated at 22°C for 60 minutes, 
after which 200 \l\ of the cell suspension were spread onto an LB plate containing ampicillin and incubated at 
30°C for 20 hours. 

35 35 
Ceif growth and induction of gene expression 

Cultures of E. coli MCI061 containing plasmid pRK248clts and an expression plasmid were grown in M9CA 
medium to mid-log phase at 30°C and then transferred to 42°C to inactivate the X P L repressor. After a 2 hour 
incubation at 42°C, cells from 1 ml of the induced culture were collected by centrifugation and resuspended in 

40 TG buffer containing 1 0 mM Tris-HCI (pH 7.4) and 1 0% glycerol in preparation for analysis. 40 

SDSpofyacryfamidegei electrophoresis and western bfot analysis 

Induced cells resuspended in Tris-glycine (TG) buffer were mixed with an equal volume of 2x sample buffer 
(Laemmli, Nature 227:680 (1970)], incubated at 95°Cfor 5 minutes and subjected to SDS polyacryl amide gel 
45 electrophoresis in 1 2.5% gels as described by Laemmli. Following electrophoresis, the proteins from the gel 45 
W ere electroblotted onto a 0.1 micron nitrocellulose membrane for 6 hours at 95 volts in 1 2.5 mM Tris and 96 
mM glycine with 20% methanol and 0.01 % SDS at pH 7.5. Western blot analysis was then carried out as 
described by Towbin et al. [Proc. Natl. Acad. Sci. U.SA 76:4350 (1979)]. 

KfJ. 

50 Primer-directed mutagenesis 

Primer-directed site specific mutagenesis was performed according to the methods described by 
Morinaga et al. [Biotechnology 2:636 (1984)]. Synthetic oligonucleotides used to carry outthe mutagenesis 
procedure were prepared by phosphite methodology [Beaucage et al., Tetrahedron Lett. 22: 1859 (1981 ); 
Matteucci et al., J. Am. Chem. Soc. 103: 31 85 (1 981 )] using an automated synthesizer and purified by poly- 

55 acrylamide gel electrophoresis by the method of Maxam et al., [Methods in Enzymology 65:499 (1 980)]. 55 

Co/ony hybridizations CA 
Colony hybridizations were performed by the method of Grunstein et al. [Proc. Natl. Acad. Sci, U.&.A. 
72*3961 (1 975)]. The same oligonucleotide used for primer-directed site specific mutagenesis was used as a 
60 probe forthe hybridizations after 5' end labeling with ^P-t-ATP using polynucleotide kinase as described 60 
byManiatisetal. [Cell 15:687(1978)]. 

^.Construction of plasmid pEV2l gag 15-512 
As noted above, constniction of a finals 
65 initiatedbyconstmctionofaplasm 65 
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amino-terminal residues. Some of the nucleotides encoding the carboxyl terminal gag residues were then 
replaced by env sequences to obtain the final fusion product. 

To make plasmid pEV2/gag 1 5-51 2, the DN A sequence encoding residues 1 5-51 2 o f the gag protein was 
obtained from 50>g of recombinant phage lambda clone \HXB-3 containing the HTLV-III proviral genome by 
5 cleavage with 50 units of Clal and 50 units of Hindi for 120 minutes at 37°Cto produce a 1700 bpfragment 5 
This DNA fragment was isolated by preparative agarose gel electrophoresis and resuspended in 0.1xTE 
buffer [1 mM Tris-HCi (pH 7.4) with 0.1 mM EDTA] to a final concentration of 0.03 pmole/pJ. 

To receive the gag DNA fragment, 2 jjig of the E. coli expression plasmid pEV-vrf2 were cleaved with 5 units 
of Clal and5units of Pvullfor 60 minutes at 37 °C, and a 1700 bp fragment containing the XP L promoter was 
10 isolated by electrophoresis in 1% agarose gel and resuspended after ethanol precipitation to a final concentr- 10. 
ationof0.03 pmole/Ml 

Plasmid pEV-vrf2 used in this construction is a derivative of readily available plasmid pBR322 [ATCC 
31 344]. The construction of plasmid pEV-vrf2 has been described in detail by Crawl etal. [Gene 35:31 (1985)]. 
Recombinant phage lambda clone XHXB-3 used in this construction has been described by Shaw et al. [Sci- 

1 5 ence 226: 1 1 65: (1 984)]. This clone was obtained using standard recombinant DNA techniques from the 1 5 

HTLV-III infected H-9cell line described above. An HTLV-III infected human cell line designated H9/HTLV-IIIB 
which is suitable for use in this invention has been deposited with the American Type Culture Collection and 
assigned accession No. CRL8543. Since Shaw etal. have published the sequence of the HTLV-III genome, a 
DNA probe can easily be constructed using those data which can be used to isolate the genome from H-9 or 

20 from any other line capable of propagating the virus. 20 
Three-hundredths (300) pmole of the gag/Clal-Hincil fragment were mixed with 0.03 pmole of the pEV-vrf2/ 
Clal-Pvull fragment and ligated with 200 units of T4 DNA ligase for 18 hours at 1 5°C in a final volume of 10 u,l. 
The ligation product was then transformed into E. coli strain MCI061 (pRK248clts), and transformants were 
selected by growth on LB agar plates with ampicillin after incubation at 30°C for 18 hours. The recombinant 

25 plasmid DNAs were analyzed for correct restriction fragment size following cleavage with Bglll.Pstlor 25 
Hindll I. Isolates positive by DNA size analysis were then evaluated for synthesis of the gag precursor poly- 
peptide by SDS polyacrylamidegel electrophoresis and Western blot analysis. 

Cultures harboring plasmid pEV2/gag 1 5-512 were induced at 40°C, and duplicate samples of the induced 
cells were prepared for analysis by SDS polyacryiamide gel electrophoresis. After electrophoresis, the gel 

30 was bisected, and half of the gel was stained with Coomassie brilliant blue, while the other side of the gel was 30 
subjected to electroblotting and Westerm blot analysis using anti-gag antibodies. 

The analysis showed that the induced cells produced proteins which migrated as two major bands in the 
gels having apparent molecular weights of about 56 and 53 Kd. The size of the complete gag 1 5-51 2 protein is 
about 56 Kd. Both of the major proteins reacted with the anti-gag antibodies. A number of minor lower 

35 molecularweight proteins which were reactive with the antibodies were also observed. 35 

C. Construction of plasmid pEV3lenv 44-640 A319-331 

The env sequences present in the final exemplary plasmid of this invention were processed in two stages 
(described in this section and in Section D, below), each of which involved the deletion of specific short 

40 nucleotide sequences by primer-directed sitespecific mutagenesis. The first of these deletions (described in 40 
this section) was required to produce plasmid pEV3/env 44-640 A 31 9-331 A 51 4-524 (Section D). 

The expression plasmid pEV3/env 44-640, the construction of which from XHXB-3 and the pEV-vrf plasmids 
has been described by Crowl etal. [Cell 47:979 (1985)], directs the synthesis of a 68 Kd HTLV-III em/specific 
protein in E. coli. DNA sequences corresponding to env amino acid residues 31 9-331 were del eted by primer- 

45 directed mutagenesis using an 1 8-mer synthetic oligonucleotide having the sequence 5' GCA TTT GTT AAC 45 
ATT AGT3'. This oligonucleotide was designed to complement env sequences so as to join nucleotides 
encoding residue 318 to those encoding residue 332. As a result of bringing these sequences together, a new 
unique Hpal site was introduced into the plasmid DNA, while 39 bp were deleted. The Stul and Hindlll sites in 
plasmid pEV3/env 44-640 were used to form the "gap" for the heteroduplex molecule. 

50 DNA from isolated colonies which were positive for hybridization to the ^P-l a beled synthetic oli- 50 
gonucleotide during colony screening were transformed into MCI061 (pRK248clts) and analyzed forthe pres- 
ence of the new, unique Hpal site. 

D. Construct/on of plasmid pEV3lenv 44-640 A3 19-331 A 514-524 

55 A second deletion within the env coding sequences was created by primer directed mutagenesis using a 55 
synthetic oligonucleotide having the sequence 5' AGA GCA GTG GCA GCA GGA 3\ This oligonucleotide 
positioned sequences encoding residue 51 3 of the env protein adjacent to those encoding residue 525, 
thereby facilitating the deletion of the nucleotides encoding residues 51 4-524. This deletion was produced in 
the plasmid using the restriction sites Hpal and Hindlll to create a single-stranded gap within the env region. 

60 After re-transformation, DNAfrom positive isolates identified by the ^P-labeled oligonucleotide probe was 60 
analyzed forthe 33 bp deletion by restriction cleavage with Hpal and Hindlll followed by electrophoresis in 
1 % agarose. This deletion was also confirmed by DNA sequencing following the chemical cleavage method 
of Maxam and Gilbert [Methods in Enzymology £5:499 (1 980)]. 
As previously noted, the A 514-524 deletion produces a significant increase in the expression of env. 
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E. Construction ofplasmidpEV2lgag 15-436/env 467-640 A 514-524 

Two fig of pEV2/gag 1 5-51 2 were doubly digested with either 5 units of Clal and 4 units of Bg1 II to produce 
about a 1300 bp fragment encoding gag residues 15-436 or with 10 units of Pstl and 5 units of Clal to produce 
abouta 1 Kb fragment containing the XP L promoter. Three-tenths jjug of pEV3/env44-640A319-331 A514-524 
5 were restricted with 1 0 units of Pstl and 4 units of Bg1 II to produce about a 4 Kb fragment encoding env 5 
residues 467-640 A 514-524. The various DN A fragments thus produced were separated in 1 % agarose and 
recovered as previously described. 

Three-hundredths pmole of each of the isolated DNA fragments were mixed and ligated for 18 hours at 
1 5°C in the presence of 200 units of T4 DNA ligase. The products of the ligation mixture were then trans- 
10 formed into MCI061 (pRK248clts) f and transformants were selected on LB agar plates with ampicillin. 10 
Plasmid DNA was prepared from unique colonies and analyzed for correct restriction fragment size following 
digestion with Pvull. 

Eleven of 1 2 isolates were positive by DNA analysis for the expected 3505 and 2882 bp Pvull fragments. 
Two positive isolates were further evaluated for synthesis of the gag/env fusion protein. These cultures were 

15 induced,and cell samples were taken for analysis by SDSpolyacrylamide gel electrophoresis, electroblotting 15 
and Western blots, with the results shown in Figure 3. 

Panel 1 in Figure 3 shows Coomassie blue-stained total cell proteins from cells harboring the recombinant 
plasmid carrying the gag/envfusion gene (g/e) and from control cells (c). Panels 2 and 3 showthe results of 
duplicate samples subjected to Western blot analysis using rabbit polyclonal antibodies against a synthetic 

20 peptide corresponding to env residues 500-51 1 (panel 2) or using sheep polyclonal antibodies against gag 20 
protein p24 which is commercially available [panel 3; see Dowbenko et al., Proc Natl . Acad. Sci. U.S.A. 
52:7748 (1985) for description of p24 protein]. In Figure 3, the immune complexes in panels 2 and 3 were 
visualized by a second antibody labeled with horseradish peroxidase, and the mobilities of molecular weight 
standards (in Kd) are indicated. The positions of the gag/env protein bands in the gels are denoted by arrows. 

25 Many modifications and variations of this invention may be made without departing from its spirit and 25 
scope, as will become a ppa rent to thos skilled in the art. For example, nucleic acid substitutions could be 
made in the gene coding for the gag/env protein of the invention (using the sequence data contained herein 
as a reference) and a somewhat different (but functionally equivalent) fusion protein could be produced 
therefrom. Such an altered product would still be encompassed by this invention as long as it had at least one 

30 antigenic determinant corresponding to the sequences of a gag and env protein of an HTLV-lll virus. Sim- 30 
ilarly, the amino acid sequence of the protein itself could be directly manipulated, with the same result. The 
specific embodiments described herein are offered by way of example only, and the invention is limited only 
by the terms of theappended claims. 

35 CLAIMS 35 

1 . A protein having an amino acid sequence corresponding to at least one antigenic determinant of a 
HTLV-lll gag protein and of a HTLV-lll env protein. 

2. The protein according to claim 1 of the amino acid sequence 
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or a functional part of equivalent thereof. 

3. A protein according to claim 1 or 2 in essentially pure form. 

4. A DNA sequence coding for a protein as claimed in any one of claims 1 to 3. 

20 5. The DNA sequence according to claim 4 in which the DNA sequence comprises all or part of the nucleic 20 
acid sequence 

ATGAATAGAA TTCGGATCCA TCGATGGGAA AAAATTCGGT TAAGGCCAGG 

GGGAAAGAAA AAATATAAAT TAAAACATAT AGTATGGGCA AGCAGGGAGC 
25 TAGAACGATT CGCAGTTAAT CCTGGCCTGT TAGAAACATC AGAAGGCTGT 25 

AGACAAATAC TGGGACAGCT ACAACCATCC CTTCAGACAG GATCAGAAGA 

ACTTAGATCA TTATATAATA CAGTAGCAAC CCTCTATTGT GTGCATCAAA 

GGATAGAGAT AAAAGACACC AAGGAAGCTT TAGACAAGAT AGAGGAAGAG 

CAAAACAAAA GTAAGAAAAA AGCACAGCAA GCAGCAGCTG ACACAGGACA 
30 CAGCAGTCAG GTCAGCCAAA ATTACCCTAT AGTGCAGAAC ATCCAGGGGG 30 

AAATGGTACA TCAGGCCATA TCACCTAGAA CTTTAAATGC ATGGGTAAAA 

GTAGTAGAAG AGAAGGCTTT CAGCCCAGAA GTAATACCCA TGTTTTCAGC 

ATTATCAGAA GGAGCCACCC CACAAGATTT AAACACCATG CTAAACACAG 

TGGGGGGACA TCAAGCAGCC ATGCAAATGT TAAAAGAGAC CATCAATGAG 
35 GAAGCTGCAG AATGGGATAG AGTACATCCA GTGCATGCAG GGCCTATTGC 35 

ACCAGGCCAG ATGAGAGAAC CAAGGGAAG TGACATAGCA GGAACTACTA 

GTACCCTTCA GGAACAAATA GGATGGATGA CAAATAATCC ACCTATCCCA 

GTAGGAGAAA TTTATAAAAG ATGGATAATC CTGGGATTAA ATAAAATAGT 

AAGAATGTAT AGCCCTACCA GCATTCTGGA CATAAGACAA GGACCAAAAG 
40 AACCTTTTAG AGACTATGTA GACCGGTTCT ATAAAACTCT AAGAGCCGAG 40 

CAAGCTTCAC AGGAGGTAAA AAATTG GATG ACAGAAACCT TGTTGGTCCA 

AAATGCGAAC CCAGATTGTA AGACTATTTT AAAAGCATTG GGACCAGCGG 

CTACACTAGA AGAAATGATG ACAGCATGTC AGGGAGTAGG AGGACCCGGC 

CATAAGGCAA GAGTTTTGGC TGAAGCAATG AGCCAAGTAA CAAATACAGC 
45 TACCATAATG ATGCAGAGAG GCAATTTTAG GAACCAAAGA AAGATGGTTA 45 

AGTGTTTCAA TTGTGGCAAA GAAGGGCACA CAGCCAGAAA TTGCAGGGCC 

CCTAGGAAAA AGGGCTGTTG GAAATGTGGA AAGGAAGGAC ACCAAATGAA 

AGATTGTACT GAGAGACAGG CTAAI I I I I I AGGGAAGATC TTCAGACCTG 

GAGGAGGAGA TATGAGGGAC AATTGGAGAA GTGAATTATA TAAATATAAA 
50 GTAGTAAAAA TTGAACCATT AGGAGTAGCA CCCACCAAGG CAAAGAGAAG 50 

AGTGGTGCAG AGAGmAAAAA GAGCAGTGGC AGCAGGAAGC ACTATGGGCG 

CAGCGTCAAT GACGCTGACG GTACAGGCCA GACAATTATT GTCTGGTATA 

GTGCAGCAGC AGAACAATTT GCTGAGGGCT ATTGAGGCGC AACAGCATCT 

GTTGCAACTC ACAGTCTGGG GCATCAAGCA GCTCCAGGCA AGAATCCTGG 
55 CTGTGGAAAG ATACCTAAAG GATCAACAGC TCCTGGGGAT TTGGGGTTGC 55 

TCTGGAAAAC TACTTTGCAC CACTGCTGTG CCTTGGAATG CTAGTTGGAG 

TAATAAATCT CTGGAACAGA TTTGGAATCA CACGACGTGG ATGGAGTGGG 

ACAGAGAAAT TAACAATTAC ACAAGCTTTA ATGCGGTAGT TTATCACAGT 

TAA 

60 60 
or a functional equivalent thereof. 

6. A recombinant vector comprising a DNA sequence according to claim 4 or 5 capable of directing ex- 
pression of said DNA sequence in a compatible unicellular host organism. 

7. A recombinant vector according to claim 6 which is plasmid pEV2/gag15-436/env 467-640 A 514-524. 

65 8. A unicellular organism containing a recombinant vector as claimed in claim 6 or7. 65 
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9. A unicellular organism according to claim 8 which is a procaryotic cell. 

1 0. A unicellular organism according to claim 9 which is an Escherichia coli cell. 

11. A unicellularorganism according to claim 10 which is Escherichia coli MC 1061 cell. 

12. A unicellular organism according to claim 8 which is a eucaryotic cell. 

5 1 3. A protein according to any one of claims 1 to 3 as constituent of a vaccine. 5 

14. A protein according to any one of claims 1 to 3 as antigen. 

15. A process for producing a protein as claimed in anyone of claims 1 to 3, comprising 

(a) culturing a unicellularorganism containing a recombinant vector as claimed in claim 6 or7 under 
appropriate conditions of growth so that said protein is expressed; and 
10 (b) isolating said protein from the culture and, if desired, purifying it. 10 

16. A process according to claim 15 wherein the unicellular organism is an E. coli cell. 

1 7. A process according to claim 1 6 wherein the recombinant vector is plasmid pEV2/gag 1 5-436/env 
467-640 A 51 4-524. 

18. A process for preparing a unicellularorganism as claimed in any one of claims 8to 12, characterized 

15 by introduction of a recombinant vector as claimed in claim 6 or 7 into a unicellularorganism. 15 

1 9. A process according to claim 1 8 in which the recombinant vector is introduced into the unicellular 
organism by transformation. 

20. A process according to claim 1 8 in which the recombinant vector is introduced into the unicellular 
organism by transduction. 

20 21 . A process according to claim 1 8 in which the recombinant vector is introduced into the unicellular 20 
organism by transfection. 

22. A process for detecting the presence of antibodies against AIDS viruses in human serum, comprising : 

(a) labeling a protein of any one of claims 1-3; 

(b) reacting the labeled protein of step (a) with a human serum sample and, allowing labeled protein- 

25 antibody complexes to form inthe reaction mixture; and 25 

(c) determining the labeled protein-antibody complexes of step (b). 

23. A process for detecting the presence of antibodies against AIDS viruses in human serum, comprising: 

(a) immobilizing a protein of claims 1-3 on a solid support; 

(b) contacting a human serum sample with the immobilized protein of step (a) and allowing immobilized 

30 protein-antibody complexes to form; 30 

(c) washing away unbound protein and antibodies from the complexes of step (b); and 

(d) determining such complexes by the addition of a reagent selected from the group consisting of labeled 
Staphylococcus aureus protein A and a labeled second anti-human IgG antibody. 

24. A processfor detecting the presenceof AIDS viruses or fragments therefrom in human serum orother 

35 biological fluid, comprising: 35 

(a) reacting a human serum or other biological fluid sample with a known titer of antibodies raised against 
a protein as claimed in any one of claims 1 to 3; 

(b) allowing the antibodies and sample to interact to form antigen-antibody complexes in the reaction 
mixture; and 

40 (c)detectingtheantigen-antibodycomplexesofstep(b). 40 

25. The process according to claim 24 in which the antibodies are enzyme-labeled, and the antigen- 
antibody complexes formed in the reaction mixture are detected by enzyme-linked immunosorbent assay. 

26. The process according to claim 24 in which a known amount of a protein of claims 1 -3 labeled with a 
radioactive label is added to the serum or other biological fluid sample, and the antigen-antibody complexes 

45 formed in the reaction mixture are detected by radioimmunoassay. 45 

27. A processforthe preparatopm of a vaccine comprising mixing a protein of claims 1-3 with a phys- 
iologically acceptable carrier. 

28. A processforthe preparation of antibodies against AIDS viruses comprising injecting into a mam- 
malian oravian animal a sufficient amount of a protein of claims 1-3 to elicit the production of antibodies 

50 against said protein and recovering said antibodies from the serum of said animals. 50 

29. A vaccine formulation comprising a protein of claims 1-3 and a compatible pharmaceutical carrier. 

30. Antibodies raised against a protein according to claims 1 -3. 

31 . The antibodies of claim 30 which are monoclonal antibodies. 

32. The use of a protein according to any one of claims 1 to 3 for the preparation of a protective immuniza- 

55 tion vaccine. 55 

33. The use of a protein according to any one of claims 1 to 3 for the preparation of antibodies against 

AIDS viruses. 

34. The use of a protein according to any one of claims 1 to 3 for detecting the presence of antibodies 
against AIDS viruses in human serum. 

60 35. A protein as claimed in any one of claims 1 to 3 whenever prepared by a process as claimed in claim 60 

15. 

36. A unicellu I ar organism as claimed in any one of claims 8 to 1 2 whenever prepared by a process as 
claimed in any one of claims 1 8 to 21 . 

37. Antibodiesasclaimedinclaim30or31 whenever pre pa red by a process as claimed in claim 28. 

65 38. A vaccine as claimed in claim 29 whenever prepared by a process as claimed in claim 27. 65 
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39. Atestkitforthedeterminationofa^ 
according to any one of claims 1 -3. 

40. A test kitfor the determination of A^ 

elicited by a protein according to any one of claims 1 -3. 
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